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1  .      INTRODUCTION 
1  . 1      RELIABILITY  GROWTH 

We  are  concerned  with  analyzing  a  particular  model  of  reliability- 
growth  .    The  "growth"  occurs  in  the  following  way:    a  system  has  some 
given  value  of  a  measure  of  reliability  at  the  beginning  of  a  length  of  time 
(i.e.,  at  the  start  of  a  test  period)  ,  and  at  the  end  of  this  period  the  value 
of  this  measure  has  changed  --  hopefully,  it  will  be  improved. 

This  change  may  be  caused  by  a  number  of  factors  .    We  shall  be  con- 
cerned, however,  with  only  those  factors  that  are  the  result  of  a  conscious 
effort  on  the  part  of  an  interested  observer  (the  "experimenter")  .    This 
effort  is  an  attempt  to  improve  or  correct  the  system  by  some  physical  ma- 
nipulation (such  as  component  replacement  or  adjustment)  or  perhaps  even 
by  possible  design  change.    The  model  considered  below  is  similar  to  many 
discussed  previously  in  the  literature  in  that  the  corrections  are  attempted 
only  after  system  failures  have  been  observed. 

A  comparison  between  the  model  considered  here  (and  its  implications) 
with  those  contained  in  the  literature  is  postponed  until  the  final  sections, 
where  the  differences  in  approach  should  become  more  apparent . 

At  this  point  we  shall  only  mention  the  sort  of  information  that  should 
be,  in  the  least,  the  content  of  any  analysis  of  reliability  growth.    This 
content  falls  into  two  categories:    inference  and  projection.    In  particular, 
an  analysis  should  be  able  to  produce  statements  (by  necessity,  probabil- 
istic ones)  ,  on  the  basis  of  the  model  and  the  failure  history  to  date, 
related  to: 
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Inference:    the  present  value  of  the  reliability 

Projection:    the  reliability  at  some  future  time,  with  or  without  con- 
tinued application  of  the  correction  ("growth")  process  . 
In  order  to  make  such  statements,  we  shall  first  discuss  two  basic  models 
which  allow  only  a  single  failure  mode  for  both  discretely  and  continuously 
failing  systems  .    This  condition  will  be  relaxed  in  adaier,  section  dealing 
with  systems  having  many  failure  modes  . 

A  final  comment  about  the  use  of  the  word  "system"  .  As  used  in  this 
paper,  it  shall  mean  simply  a  piece  of  equipment  that  has  an  assigned  task 
to  perform,,  If  it  does  not  perform  it,  it  is  said  to  have  "failed"  .  The  sys- 
tem can  be  very  simple,  containing  perhaps  only  one  component.  Or  it  can 
be  extremely  complicated  .  The  only  characteristic  we  shall  use  to  distin- 
guish between  those  degrees  of  complexity  is  the  number  of  different  (iden- 
tifiable) ways  it  can  stop  functioning:    i.e.,  the  number  of  failure  modes  . 

1.2      NOTATION 

The  following  notation  will  be  used  in  the  description  and  analysis  of 
the  model  discussed  above: 

.Capital  letters  stand  for  events  or  states  of  nature. 


.An  underlined  variable,  e.g.,    x,    is  a  random  variable. 


r    /    \  r         r  xi.  t  PrOD  .    fx    ^    X   ^    X  +   Ax] 

.f   (x)  =  p.d.f .  of  the  r  .v    x  =  lim * = * 

x  ~  Ax 

—  Ax  —  o 

.6(x)  =  Dirac  delta  function*  of    x . 


*Defined  most  conveniently  as  the  limit:     6(x)  =    lim  [h(x,£)]    where 

1         n  ^  6  —  o 

h(x,£)  =    {  fc 

0         otherwise 
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.P(A|B)  =  prob .  {event  A  given  event  B  has  occurred]  . 
.f   (x|A)  =   p.d.f.  of    x    given  A  has  occurred. 

prob.  fx  £  x  £  x+  dxlA] 

=     lim  : = ! — - 

Ax— o  Ax 

.E(x|A)    =       xf   (x|A)dx  =  conditional  expectation  of    x    given  A. 

r  2 

.V(x|A)    =       [x  -  E(x|A)]     f   (x|A)dx  =  conditional  variance  of    x 
given  A . 

.The  letter  H  will  be  used  to  denote  the  event  (state  of  nature)  "histori- 
cal experience":    all  the  prior  knowledge  that  is  available  concerning 
the  model,  values  of  parameters  of  the  model,  etc.    Probabilities  and 
p.d.f  .'s  conditioned  only  upon  H  are  called  "a  priori" ,  or  ;i  prior"  . 

.A  vector  is  noted  by  an  arrow  over  it,  with  the  vector  dimension  being 

indicated  in  parentheses  ,  e  .g  .  ,    t(n)  =   (tn  ,t.  ,t.  ,  .  .  .  t  )  . 

12     3  n 

2  .      THE  CONTINUOUS  MODEL 
2.1      DESCRIPTION 

The  system  has  a  single  failure  mode,  and  the  time  between  failures, 
t_,     is  a  random  variable  (r.v.)  with  probability  density  function  (p.d.f.) 

f  (t)    =  re~rt  0  £  t  £  *   . 

The  parameter    r    is  commonly  called  the  failure  rate  of  the  system  (or, 
more  properly,  of  the  particular  mode  of  failure)  .    Since  all  relevant  meas- 
ures of  reliability  for  an  exponentially  failing  system  can  be  obtained  from 
the  failure  rate,  it  will  be  sufficient  to  concentrate  upon  its  characteristics 
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only.    The  exponential  function  is  not  as  restrictive  as  it  may  seem  at  first. 
Although  it  is  certainly  a  simplistic  assumption  to  make  about  complex  sys- 
tems ,  it  becomes  more  valid  as  the  systems  become  more  elementary  and 
serve  to  comprise  the  components  of  an  even  greater  system.    In  addition, 
a  conceptually  simple  (but  laborious)  extension  of  all  the  results  of  this 
paper  is  possible  when  it  is  postu. ~ted  that    r    is  in  fact  a  function  of  time 
since  last  failure  . 

The  system  is,  at  any  time,  in  one  of  two  possible  states  (again,  with 
respect  to  a  single  failure  mode): 
U  =  Unrepaired  State 
R  =  Repaired  State 

The  numerical  value  of  the  failure  rate    r    depends  upon  which  state  the 
system  is  in: 

If  the  system  is  in  the  unrepaired  state  U ,  then    r  =    \  ; 
If  the  system  is  in  the  repaired  state  R,  then    r  =  p,   . 

The  numbers     X    and    y,    can  be  any  non-negative  values  ,  and  in  fact 
y    .is.: often   zero  .    On  the  other  hand,  the  value  of    y    might  not  he  zero . 
Thus,  although  the  system  is  said  to  be  "repaired"  ,  it  might  still  exhibit 
failures  ,  albeit  the  failure  rate  when  repaired  might  be  quite  low  . 

By  virtue  of  a  test  program,  the  system  changes  states  in  the  following 
restrictive  way,    After  every  failure,  if  the  system  is  in  U  it    l)  goes  to  R 
with  probability    a  (the  "repair  probability");  or    2)  remains  in  U  with  prob- 
ability   (1-a)  .    If  the  system  is  in  R,  it  remains  in  R  with  probability  one. 
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Thus,  there  can  be  only  one  transition  to  state  R;  once  the  system  is  re- 
paired, it  remains  so. 

This  repair  attempt  happens  instantaneously,  after  which  the  system 
operates  until  the  time  of  the  next  failure  (this  time  being  again  a  random 
variable  with  failure  rate  depending  upon  whether  the  system  has  been  put 
into  state  R  or  has  remained  in  state  U)  . 

The  model  may  be  represented  by  a  two-state  Markov  process,  as 
shown  by  the  flow  diagram  of  Figure  1  .    The  times  between  the  transitions 
indicated  in  the  diagram  are  the  times  between  failures  and,  thus,  are  con- 
trolled by  the  failure  rate  of  whichever  state  the  system  is  in: 


FIGURE  1 

Flow  diagram  representation  of  growth  model 
U  =  Unrepaired  state  (failure  rate  -    X) 
R  =  Repaired  state  (failure  rate  =  \x ) 
a  =  repair  probability 
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Which  state  the  system  is  in,  i.e.,  whether  or  not  it  has  yet  been  re- 
paired, is  unknown  to  the  observer,  and  he  can  draw  conclusions  as  to 
whether  or  not  the  system  is  repaired  only  by  observing  the  basic  data:    the 
successive  failure  times  (or,  equivalently,  the  times  between  failures)  . 

Finally,  it  is  possible  to  allow  for  the  system  to  start  off  in  a  repaired 

state  by  assigning 

p     =  prob .  (system  is  in  R  at  the  start  of  the  test)  . 

Except  for  one  situation  to  be  considered  later,  however,  we  shall  always 

assume  that    p     =  0  . 
o 

In  the  above  model,  it  is  easy  to  see  that  since  the  system  ultimately* 
will  go  to  state  R,  if    ^  <   X       the  failure  rate  of  the  system  will  eventually 
decrease,  and  thus  the  reliability  will  grow.    On  the  other  hand,  if  (for 
some  unforeseen  reason)    ^  >   X  ,    it  is  possible  to  degrade  the  system  re- 
liability by  such  a  test  routine  . 

2  .2      SOME  BAYESIAN  CONSIDERATIONS 

If  the  numerical  values  of  the  parameters    a,    jj,    and    X  ,  defined  above, 
are  known,  then,  as  will  be  shown,  it  becomes  a  straightforward  problem  to 
make  probabilistic  statements  about  the  failure  rate    r,    at  any  time,  on  the 
basis  of  any  amount  of  failure  information.    This  is  essentially  because  the 
value  of    r    depends  only  upon  the  state  of  nature  (U  or  R)  ,  and  the  transition 
from  U  to  R  is  the  extremely  simple  process  shown  in  Figure  1  .    If  the  values 


*As  long  as    a  ^  0  . 
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of  these  parameters  are  unknown,  however,  then  various  methods  must  be 
used  in  order  to  obtain  estimates  of  them  and  then,  in  turn,  to  make  state- 
ments about    r.    This  quest  is,  of  course,  within  the  purview  of  classical 
statistics,  and  much  has  been  written  concerning  the  estimation  of  param- 
eters of  models  similar  to  the  one  treated  here  and  associated  confidence 
intervals  (see  for  example  [1]  )  . 

The  classical  approach  is,  in  essence,  to    1)  define  some  estimator 
(of    r    in  this  case)  ,  examine  it  for  unbiasedness  ,  efficiency,  sufficiency, 
etc.;  and  then  to    2)  define  an  interval,  the  end  points  of  which  are  random 
variables  derived  from  the  observed  data,  which  will  contain  the  true  value 
of  the  parameter  with  some  pre -determined  probability. 

The  approach  we  choose  to  take  is  a  purely  inferential  one.    We  state 
that  before  any  experimentation  is  done  the  failure  rates  associated  with 
states  U  and  R  are,  respectively,  the  random  variables    X    and    jj,  .     (The 
sampling  process  associated  with  them,  if  one  finds  it  necessary  to  imagine 
such,  is  the  process  of  selecting  a  system  to  test  from  a  batch  of  systems, 
the  resultant  picked  system  having  associated  failure  rates  that  are  thus 
random  variables  selected  from  the  population  consisting  of  all  possible 
systems  to  be  tested  .) 

We  shall  also  assume  that  the  repair  probability    a    is  known.     (An  ob- 
vious extension  of  the  model  results  if    a    is  also  assumed  to  be  a  random 
variable  .) 
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The  joint  probability  density  function  of  the  random  variables    X.    and    p,  , 
before  experimentation  begins,  must  be  given,  and  it  is  assumed  that  this  is 
in  fact  known.    This  (most  likely  subjective)  prior  density  function  is  defined 
to  be 

f,     U,n|H). 

After  some  experimentation  and  possible  correction  has  gone  on  and  a 

series  of  failure  times    t(n)  =   (tn  ,t_,  ...,  t  )    has  been  noted,  then  use  of 

12  n 

the  definition  of  conditional  probability  allows  one  to  determine  the 
"posteriori"  density  function  . 

f      (X  ,n|H,T(n)  )  . 

Ajj, 

Since  the  failure  rate  of  the  system  at  any  time  is  a  function  of  both    X.    and 
Ij,  ,    it  is  itself  a  random  variable    r_,    with  its  own  conditional  p.d.f . 

The  purpose  of  this  study  is  to  in  fact  determine  this  density  function 
for  x,  both  at  the  outset  of  a  test  period  and  as  a  function  of  a  given  set 
of  subsequent  failure  times  .    In  addition,  we  shall  make  statements  con- 
cerning the  density  function,  and  its  moments,  for  the  failure  rate  jc_   at  any 
given  time  in  the  future  . 

2  . 3      KNOWN    X    AND    »  :    RELIABILITY  PROJECTION 

Let  us  first  suppose  that    X    and    ^    are  deterministic  and  their  exact 
numerical  values  are  known.    The  failure  rate    r_    is  still  a  random  variable, 
however,  since  it  depends  upon  whether  the  state  of  nature  is  U  or  R,  and 
that  is  itself  probabilistically  determined.    The  p.d.f.  for  jr_    is  easily 
determined  . 


With  a  total  test  time  of    t,    the  p.d.f.  for    r_  is    f  (r;f) 


r 


f  (r;r)    =   6(r-X)P(U  )    +   6(r-y,)P(R    )  (1) 

r  t  t 


where 


P(U    )    =  prob  .  [system  is  in  U  after  total  test  time     t] 

T 

P(R   )    =  prob.  f system  is  in  R  after  total  test  time    t] 
t 

The  delta  function  notation  is  used  as  a  convenient  way  to  write  a  p.d  .f . 

for  the  (at  this  point)  discrete  random  variable   _r . 

In  what  follows  we  assume  that  the  system  starts  out  in  the  unrepaired 

state  P,  so  that    p     =  0  .     (The  development  can  be  easily  extended  when 

p     ^  0,     and  this  will  be  done  in  a  later  section,  where  the  start  of  the 
o 

corrective  testing  period,    t  =  0,    occurs  after  some  previous  amount  of 
testing  .) 

In  order  to  calculate    P(U    )  =  1  -  P(R   )  ,    we  note  that  the  event    (U    ) 

T  T  T 

can  be  decomposed  into  a  union  of  the  mutually  exclusive  events    (U     ,F.) 

T         1 


where 


(F.)    =  event  {the  transition  from  U  to  R  takes  place  on  the  i      failure} 


so  that 


(UT)    =U(UT,F.).  (2) 


i=l 


Since  the    F.    are  mutually  exclusive  events,  we  have 


P(U    )    =     I  P(U     ,F.)    =     I  P(U     |F.)P(F.)  (3) 

T  .      ,  T         1  .      ,  T   '      1  1 

1=1  1=1 
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The  number  of  the  failure  at  which  the  transition  from  U  to  R  takes  place  is 
geometrically  distributed  with  parameter    a,     so  that 

P(F.)    =  a(l  -a)1"1    .  (4) 

Furthermore,  we  see  that 

P(U    |F.)  =  prob.  [system  is  in  U  at    t    given  it  goes  to  R  at  i      failure} 

=  prob.  {less  than    i    failures  in  time    t    while  in  U] 

.^^Ilie-XT  (5) 

j=0      J  " 

which  all  combine,  to  give 

P(U    )    =     Z       Z     -^-e"XTa(l  -a)1"1  (6) 

T         i=l    j=0     J  " 

Changing  the  order  of  the  summation  gives 

P(U  )    =     Z         Z     ^4-  e"XTa(l  -a)1"1 
T         j=0    i=j  +  l     J  ' 

00  ] 

-     (Xt)        -Xt,.         J  -aXT  /„x 

=     Z    -j-t—    e        (1  -  a)     =  e  (7) 

j=0     J° 

This  result  can  be  verified  by  noting  that  the  rate  of  transition  from  U  to  R 
is    aX  ,     since 

prob,  {transition  from  U  to  R  in    At} 

=  prob .  {failure  in    At|U}      prob.  {repair} 

=    X  A  Ta 
and,  thus,  the  probability  of  no  transition  in  time    t    is,  from  the  Poisson 
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distribution,     e   '        .    The  longer  derivation  is  useful,  however,  in  that  it 
indicates  a  technique  to  be  used  again  below  . 

The  above  equations  thus  show  that  the  p.d.f.  of  the  failure  rate  _r    at 
time    t    after  start  of  testing  is 

fr(r;T)    =   6(r-X)e"aXT+   6(r-^)   (1  -  e~aXT)  (8) 

Note  that  this  expression  reflects  a  probability  statement  made  before 
the  process  starts  .    In  other  words  ,  we  can  interpret  the  quantities 

00 

E(h;t)    =         rf  (r;-r)    =   \e  +  y,  (1  -  e  ) 

o     — 

=  n  +   (\~»)e~aXT  (9) 

and 

V(i)'t)    =    f"[r  -E(x;T)]2f  (r;x) 
Jo  - 

=   (X-M,)2e"aXT(l  -e"aXT)  (10) 

to  be  the  present  projection  of  what  the  mean  and  variance  of  the  failure  rate 
_r   will  be  at  time    t    (in  the  future)  after  corrective  testing  . 

These  projections  are  useful  in  themselves  as  aids  to  reliability  pre- 
diction.   That  is,  if  we  know  the  values  of  the  unrepaired  and  repaired 
failure  rates  and  the  value  of  the  repair  probability    a,  then  equation  (9) 
gives  an  estimate*  of  what  the  reliability  will  be  at  some  time    t    after 
testing  begins,  and  equation  (10)   (actually,  the  square  root  of    V(r,"r)  )    gives 
an  indication  of  the  preciseness  of  that  estimate  .    The  behavior  of  these 


* 

Optimal  (i.e.,  cost-minimizing)  for  a  quadratic  loss  function. 
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quantities  satisfy  intuition:    the  expectation  of  the  failure  rate  starts  off  at 

X    and  approaches    jj,  „    The  variance  starts  at  zero  (we  know  r  =  X    at    t  =  0)  , 

and  returns  to  zero  as     t  —  «    (r   will  certainly  be  equal  to    y,    by  that  time, 

as  long  as    a  /0),  with  an  interesting  maximum  occurring  at    t  =  ~~  . 

a  A 

2.4      KNOWN    X    AND    ^    RELIABILITY  INFERENCE 

All  of  the  above  analysis  has  been  made  under  the  consideration  that 

the  test  was  yet  to  be  done  „    The  analysis  is  extended  now  to  the  situation 

where  testing  has  been  going  on  for  a  time    t,    and    n    failures  have  been 

observed  at  times    t,  ,  t„ ,    .  .  . .  t     =  t(n)  .  where    t     ^   t  <  t      ,  .    (For  ease 

1       2  n  n  n+1 

in  notation  we  shall  now  let    t   =  t(n)  ,    with  the  understanding  that  the  vec- 
tor is  of  dimension    n  „) 

Again,  assuming  still  that    ^    and    X    are  deterministic  and  known,  we 
would  like  to  calculate  the  appropriate  conditional  p.d.f .  for  the  failure 
rate:    f  (r|  t  ,   t)  .    To  do  so  we  shall  need  to  calculate    P(R    |  t  )  .    This  is 
shown  by  extending  equation  (1)  of  the  preceding  section, 

f  (r|T;  t)  =   6(r-X)P(U    |7)  +   6(r-p,)P(R    |7)  (11) 

_r_    '  t  t 

We  again  make  use  of  the  events    F.    to  write 

i 

00 

P(U    I  t  )    =     E    P(U     ,F.  |7) 

T  .  T         1  ' 

1=1 


=   £  p(u  If.  ,  t )  p(f.  1 1  )  (12) 

1-1       T    x  x 
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But  now  we  see  that 

P(U    |F.  ,  t  )  =  prob .  [the  system  is  in  U  at    t    given  it  goes  to  R  at 

the  i      failure,  and  failures  are  observed  at 

V42 ln    ^   'nS  T<Vll 

'0       if  i  £  n 

1        if   i  >  n  (13) 

so  that  equation  (12)  becomes 

00 

P(U    I  t  )    =       L     P(F.|T)  .  (14) 

i=n+l 

Using  Bayes1  rule 

P(T|F.)P(F.)  P(T|F.)a(l-a)i_1 

P(F  |  t  )    =    i '-   =   "—^ (15) 

P(  t  )  P(  t  ) 

Under  the  condition  that    i  >  n    (i.e.,  for  all  terms  in  the  sum  in  equation  (14)  ), 

and  in  fact  the  i      failure  is  observed  to  lie  between    t.    and    t.  +  dt. 

l  11 

-\t         -X(t   -t  )  .     -\(t   -t   _  )    -x(T-t  ) 

P(7|F.)=\e       \e        Z         ..iXe        n     n       e  n  dtdt„ .  .  .  dt     (16) 

1    i  1     2  n 

=  \VXTdT  d7) 

since  the  times  between  the  first    n    failures,  given  that  transition  to  R 
occurs  at  some  failure  after  the  n     ,  are  identically  distributed  exponential 
r.v.'s  with  common  parameter    X.    The  last  term  in  equation  (17)  ,     e  n   , 

is  due  to  the  fact  that  no  failures  are  observed  in  the  interval    (t    ,   t)  . 
Combining  this  result  with  equations  (14)  and  (15)  yields 
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»  n   -Xt   ,_       xi-l  ,-* 

p(u  |T)  =     e      Xe      a(i~a)     dt 

T  i=n+l  P(  t ) 

_   X  e        (1  -a)    d  t  ^ 

p(T) 

We  now  turn  our  attention  to  calculating    P(R   |  t )    in  much  the  same  fashion: 


P(R    |T)    =     L    P(R    ,F.|  t) 

T1  .      ,  T         1  ' 

1=1 


=     L    P(R    IF.  ,  t)  P(F.|  t)  (19) 

.      n  T1      1  1  ' 

1=1 

Here  we  see  that 

'l       if    i  £  n 
P(R   |F    ,  t)    =1  (20) 

70       if    i  >  n 


so  that 


n 
P(R    |T)    =     E    P(F.|  t) 

T1  ,      ,  1 ' 

1=1 


- 


=     S    ; =    — (21) 


n     P(T|F.)P(F.)  2  /(tlF^ad-a) 

i=l  P(T)  P(T) 

By  the  same  arguments  that  lead  to  equation  (17)  we  find  that,  when    i  £  n 


-xt      -x(tft  )  -x^-tj.-i)     -,(t.+1-t.) 

P(t  |F. )  =    Xe  Xe  .  .  .  Xe  y,  e 

.  o  o  u,  e  . . .  e  dt,dt„  .  .  .  dt 

p  1     2  n 

i    "^   n_i    "n(T-tJ   _ 
=    Xe       V       e  dt  (22) 
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Using  this  in  equation  (9)  gives 

n      .    -Xt.         .     -u(T-t.)  .    . 

_     1         1    n-i      **        i      ,_       .1-1    -* 
L  X  e         u,        e  a ( 1  -  a)        dt 

P(R    |7)    =    — (23) 

P(t) 

In  order  to  evaluate    P(t  )  ,    the  common  denominator  in  equations'  (18) 
and  (23)  ,  we  finally  note  that  since    (R  )     and    (U  )    are  exhaustive  and 

T  T 

mutually  exclusive 

P(R    1 7)    +  P(U    1 7)    =    1 

which,  by  use  of  equations  (18)  and  (23)  gives 

p(u  1 7)  =  l-  p(r  1 7) 

T1  T1 


(24) 


n    -Xt..        .n 
X    e        ( 1  -  a) 

L(~t  ;  X  ,  p.) 

where  the  function    L(t  ;  X  ,  n)    is  defined  to  be 

n         -Xt.  ~u(T-t.) 

t/T.  i       \    -     t-  yi  i    n-i      *"         i ,.       ,i-l         n    -Xt,.       ,n 

L(t  ;  X  ,  (j.)    =     T  X  e         y,        e  a(l  -a)         +   x    e        (1  -a) 

i=l 

=    P(7)/d7  (2  5) 

Combining  all  this  with  equation  (11)  gives,  for  the  density  function  of  the 

failure  rate    r  ,    having  observed  failures  at    t,  .  t„ .    . ...  t      during  a  test 
_  =  1       2  n 

period  of  length    t: 

n      .    -Xt.  -|i(T-t.) 

6(r-n)  Z  x'e       Vn_1e  x  a(l  -  a)1_i  +  6(r  -  X)  Xne    ^T(l-a)n 

fr(r|7;  t)    =    *=* (26) 

—  L(  t ;  X  ,  |j, ) 
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Equations  (24)  ,  (25)  ,  and  (26)  are  the  only  ones  necessary  to  make 

inferential  statements  about  the  reliability  at  time    t  ,  given  failures  at 

times    t,  ,  t„  .    .  . . ,  t    ,    and  given  the  values  of    X  ,  u   and    a  . 
1       2  n  ^ 

For  example,  let  us  suppose  that    jj,  =  0    (a  repaired  system  never 
fails)  o    Since 


00 

E(j_|T;  t)    =         r  fr(r|7;  T) 


dr 


we  find  that 


-a-t,,       v  ,  n 


E(r  | T;  r)   -        _£        ^^  =  te  (27) 

n   ,   M       \    ~^T  a  n 

ae  +  (1  -a)e +   Xe 

1-a 


and 


-x(T-tn) 

P(UT|t)-l-P(R|t)=  e  (28) 

a  n 

- —  +  e 
1-a 

In  this  case  it  becomes  apparent  that  inferential  statements  can  be  made 

with  only  the  information  consisting  of  the  length  of  time  since  the  last 

failure    (T-t    )  .    This,  of  course,  is  intuitively  clear,  since,  if    p,  =  0 ,  at 

the  time  of  the  last  failure  the  system  couldn't  possibly  have  been  repaired. 

2o5     UNKNOWN    X    AND    n :    RELIABILITY  PROJECTION 

We  come  now  to  the  more  interesting  and  practical  situation:    that 
where  the  parameters     X    and    \i    of  the  process  are  unknown  at  the  start  of 
the  testing.    Inferential  statements  about  the  values  of  these  will  come  in 
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the  next  section  .    Here  we  will  be  concerned  with  only  deriving  predictive 
statements  analagous  to  those  implied  by  equations  (9)  and  (10)  . 

The  basic  technique  used  here  is  to  simply  consider    X    and    p.    to  be 
random  variables    _X    and    £_,    with  respective  p  .d  .f.'s    f  (X|H)    and 
f  (M<|H) ,  or  possibly    ajointp.d.f.    f     (X#M,|H).    These  a  priori  density 
functions  are,  at  least  at  the  start  of  experimentation,  most  probably  sub- 
jective ones  .    That  is,  they  represent  all  information  available,  at  the  time, 
relevant  to  the  failure  rates  in  question  and  expressed  in  terms  of  an  appro- 
priate density  function*  .    If  some  quantitative  information  is  available, 
from  previous  tests,  etc.,  then  of  course  these  density  functions  should  be 
conditioned  not  only  upon  the  event  H,  but  all  other  observed  relevant  data  . 

As  a  first  step,  we  re-write  equation  (8)  with  the  notation  expanded  to 

emphasize  the  fact  that    X.    and    \i_   are,  in  that  equation,  deterministic  and 

have  known  values    X    and    m-  ,  respectively.    In  other  words , 

f(r;T,X/|J.)    =   f(r,-T,_X=  X,yL=n,) 
JL  JL 

so  that 

fr(r;T,X,n)    =    6(r-X)e"aXT+   6(r-n)   (l-e"aXT)  (29) 

We  now  use  the  well-known  fact  that  for  any  probability  that  is  itself 
conditioned  so  that  it  is  a  function  of  a  realization  of  a  r  .v . ,  i.e., 


*The  best  techniques  for  producing  such  subjective  functions  are,  and  will 
probably  always  be,  subject  to  a  great  deal  of  controversy.    We  side-step 
these  philosophical  issues  here.    The  interested  reader  is  referred  to  the 
copious  literature  on  the  subject,  for  example  [7]  . 
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P(A|x  =  x)  ,    the  unconditioned  probability  is  simply  the  expectation  of  the 
conditioned  one ,  i.e., 

00 

P(A)    =     '  P(A|x=  x)  f  (x)dx*  (30) 

j  x 

-co 

Using  this  relation,  we  may  write  in  place  of  equation  (8) 

CO  CO 

fr(r;r)    =    J*       '  fr(r;T,X,n)fXMj(\/Ml|H)d\dn    . 
0       0~ 

In  all  that  follows  we  shall  assume  that   _X    and   \x_   are  independent, 
for  ease  of  notation,  so  that  we  may  write 

The  discussion,  however,  can  be  easily  extended  to  the  case  when  they  are 
dependent  variables  „    We  shall,  for  convenience,  also  drop  the  conditioning 
event  H,  since  all  statements  that  can  be  made  are  all  eventually  conditioned 
upon  prior  experience  „ 

Performing  the  indicated  integration,  we  find 

CO  CO 

fr(r,"r)  =  <        '  *[6(r-\)e"aXT+  6(r-„)  (1  -  e~aXT)  ]  f^X)  f  (m)  dX  d^ 

0      0 

=    f^(r)e"arT+  f   (r)  J   (l-e"a§T)fx(§)  d?  (31) 

0  ~ 

from  which  we  may  derive 

00  00 

E(r;T)    =     r?fx(?)e"a?Td?+  E(t±)  J  (1  -  e"a§  T)  f^(  §)  d§  (32) 


*For  example,  see  Parzen  [11]  p.  336 


-  18  - 


An  expression  for    V(r,*T)    may  also  be  derived,  but  the  specific  form  is 

complicated  and  does  not  provide  any  easy  interpretation  . 

As  an  example  of  the  use  of  equation  (32)  ,  consider  the  case  where, 

again,    ^    is  known  and  is  in  fact  equal  feero  (or,  equivalently,  it  is  a  r. v. 

with  p.d.f.    f  (m)    =   6(|i)  )  .    Then    E(i-;t)    becomes,  from  (32) 
ii 

E(r;r)    =    J   5  f^)e"a§Td?  (33) 

0 

The  behavior  of  this  expected  value  of  failure  rate  at  a  time    t    into  the 
future  (under  the  corrective  test  program)  can  be  explored  by  selecting  an 
appropriate  form  for  the  prior  p.d.f .  on    K  .    For  convenience,  we  select 
for  this  prior  density  function  the  conjugate  form  [  12]  gamma  distribution 

fx(X)    =    I   -ta'  (34) 

(  0  otherwise 

which  has  the  moments 

BCD  -  f 
v(A)  =  Jx 

This  distribution  thus  has  enough  freedom  for  the  fitting  of  a  desired  mean 
and  variance  by  appropriate  selection  of  the  constants    a   and    3. 
Putting  equation  (34)  into  (32)  yields 
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=    E(X)  (1+   f)-(a+1) 

2.6      UNKNOWN     X    AND    p,  :     RELIABILITY  INFERENCE 

The  problem  of  inferring  the  value  of   r.  after  the  observation  of  a  data 
vector    t  =  t(n)    is,  of  course,  complicated  by  the  fact  that  now    \    and    y^ 
are  also  random  variables:    A  complete  solution  must  also  make  inferential 
statements  about  the  posterior  distributions  for  these  rates  as  well  as  for   r_. 

These  statements,  via  the  appropriate  posterior  density  functions,  may 

be  easily  made,  however,  by  the  judicial  use  of  equation  (30)  .    For  example, 

we  note  that  equation  (24)  now  should  be  written 

n   -Xt,_       ,n 
e        (1-a, 

L(7;   X  tV) 

The  unconditional  probability  that  the  system  is  still  in  the  unrepaired  state 
becomes,  using  Bayes'  Rule  twice,  and  all  limits  of  integration  from  0  to  so  . 


n   -at,         ,n 
P(U    |T;  X  =  X  ,  j*  =  \i)    =   ^ U-=^J-  (35) 


P(UT|t)   =  P(Ut|  t;  X  =  X,j±  =  u-)f     (X  ,  p.  |  tjdXdn 


P(U    I  t;  X  =  X  ,  Ml=  h) 


AJJ. 
L(T;X,pt)fx   (X,,j,)  dXdn 


J    Jl(T;  X,n)fx   (X,m.)  dXdn 


P(Ut  ,  T|X=  X  ,i±.=  p.)  f^(X  ,  n)  dX  dn 


L(t;X/Ml)  fXpi(XM,)  dXdp, 
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f    f    Kne"XT(l-a)nf,,(X/n)  dXdn 

=  TT~^ = (36) 

J         L(t;X,u.)f     (X  ,n)  dX  dp, 

In  addition,     P(R    I  t)    may  be  obtained  by  noting  that 
t  ' 

=    1  -  P(R    |7)  (37) 

Similarly,  it  may  be  shown  that  the  appropriate  posterior  density  func- 
tions for  the  rates    X    and  jy.    are 
P(T|X  =  \)fx(\) 


f.(X|t;T)   - 


Jp(T|x=  x)f (\)  dx 

f  L(T;p.  ,  k)  MX)  f M  dp. 

J— 1 — t {38) 

I  L(t;^  ,  X)fx(X)f   (n)dXdn 


and 


_              f  L(7;n  ,x)fx(X)f  (p.)  dX 
f>|t;T)  =  ■L-— ; = B. (39) 

*  I    )  L(t;n  ,X)fx(X)f   y  dXdpL 

where  we  have  let    f,     (X  ,  |i)    =  f  (X)  f  (u)    for  ease  of  notation. 

XU  X  \± 

Finally,  the  same  sort  of  manipulation  leads  to 

n      .    -Xf,        ,     -r(T-t.)  .    , 

p    _     i         l  n-1  r     l      ,,      .1-1,   /    v   ,  n   -rT,,      .n 

LXe         r        e  a(l-a)        f  (X)  dx  +  r  e       (1-a) 

fr(r|T;T)   =   -^ (40) 

L  j  L(T;X,M.)  fx(X)  f^p.)  dX  d  n 

Although  these  equations  seem  formidable,  they  are  extremely  useful 
and  valuable  and  provide  all  the  information  necessary  for  inferential  state- 
ments about  the  system  reliability,  given  an  observed  set  of  failure  times  . 
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In  particular,  knowledge  of  the  expected  values  of  the  random  variables 
\ ,,  ±l   and    x_ ,  given    t  ,    gives  the  experimenter  good  estimates  of  the  value 
of 

a)  the  failure  rate  before  testing  began:    equation  (38) 

b)  the  eventual  value  of  the  failure  rate  after  unlimited  correctional 
testing:    equation  (39) 

c)  the  present  value  of  the  failure  rate:    equation  (40) 
Additionally,  the  probability    P(R   I  t  )    that  the  system  has  in  fact  been  re- 

T1 

paired  is  given  directly  by  equation  (37)  . 

As  is  common  in  all  Bayesian  inference  schemes ,  the  foregoing  develop- 
ment is  liable,  with  some  justification,  to  the  criticism  that  the  results  are 
dependent  upon  the  particular  prior  distributions  used:    f  (X)    and    f  (M-)  . 
This  is  indeed  so,  but  the  real  concern  should  be  with  the  sensitivity  of  the 
results  to  variations  and/or  extremes  in  the  selection  of  prior  functions  .    In 
particular,  it  is  certainly  possible  to  select  the  prior  distributions  with  suf- 
ficiently large  variances,  so  that  the  result  of  the  analysis  becomes  rela- 
tively independent  of  the  prior  expectations  . 

On  the  other  hand,  if  the  failure  rates  in  question  are  to  any  degree 
known  in  advance,  it  seems  unreasonable  not  to  allow  the  analyst  to  make 
use  of  his  knowledge  —  particularly  for  the  making  of  projections  . 
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3  .      THE  DISCRETE  MODEL 

3.1      MODEL  DESCRIPTION 

A  model  similar  to  the  one  discussed  above  is  now  developed  for  the 
case  where  a  system  exhibits  "discrete"  failure  behavior.    That  is,  the  sys- 
temiundergoes  "trials"  ,  and  at  each  trial  the  system  either  succeeds  or  fails 
We  assume  that  these  trials  are  independent  (the  equivalent  of  the  assump- 
tion of  exponential  behavior  for  the  continuous  model)  .    A  convenient  and 
appropriate  measure  of  reliability  of  the  system  at  any  time  is  simply 
p  =  1  -  q,    where 

p  =    probability    {success  on  the  next  trial} 
q  =    probability    {failure  on  the  next  trial} 
In  order  to  model  a  reliability  growth  effect,  we  again  consider  the  sys- 
tem to  start  in  state  U,  from  which  it  has  probability    a    of  making  a  transi- 
tion to  state  R  after  every  failure  .    We  then  define  the  probabilities 
u   =    probability    {system  fails  on  a  trial  given  in  state  U} 
v  =    probability    {system  fails  on  a  trial  given  in  state  R} 
The  analysis  now  proceeds  exactly  as  in  the  preceding  sections,  and 
requires  only  some  obvious  notational  changes  (to  account  for  the  discrete 
character  of  the  failure  data)  and  additions  . 
Let: 

x   =    fx,  ,  xn  ,    ...  x   }    =  the  observed  data  vector  after    n    trials, 
12  nJ 

where    x.  =  0  or  1    as  the  i      trial  results  in  a  failure  or 
i 

success,  respectively 
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y.   =      L  x.     (i  =  1 ,  2  ,    .  .  .  n)  =    the  cumulative  number  of  successes 
1        k=l   * 

up  to  and  including  the  i      trial 

z.  =  n  -  y.    =    the  cumulative  number  of  failures  up  to  and  including 

•  th      .    . 
the  i      trial 


3.2      KNOWN    u    AND    v:     RELIABILITY  PROJECTION 

We  first  consider  the  case  where  the  failure  probabilities  u  and  v  are 
deterministic  and  known.  At  the  end  of  N  trials,  the  system  failure  proba- 
bility is  the  random  variable    cj_ ,    with  p.d.f.    f  (q  ;  N)    given  by 

H. 

f   (q;N)    -    6(q-u)P(UN)  +    6(q-v)  PfR^)  (42) 

in  direct  analogy  with  equation  (1)  ,  where 

P(U    )    =    probability    {system  is  in  U  after  N  trials] 
P(R   )    =    probability    {system  is  in  R  after  N  trials] 

The  value  of    P(U    )    is  readily  calculated: 

N 
P(U    )    =    [probability    {system  not  repaired  after  one  trial]] 

=    [1  -  probability  {system  is  repaired  after  one  trial]] 

n  tN 

=    [  1  -  au] 

since  all  the  N  trials  are  in  the  U  state,  are  independent,  and  a  failure 

(with  probability    u)  is  necessary  before  a  repair  (probability    a)  is  made  . 

Equation  (42)  then  becomes 

f   (q;N)    =    6(q-u)(l-au)N  +   6(q-v)[l  -(l-au)N]  (43) 
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The  expectation  of  the  system  failure  probability  at  the  end  of  N  trials  is 

E(<i;N),    where 

1 
E(a;N)    =    J  q  f  (q;N)dq 


0 

=    u(l-au)N  +  v[l  -  (l-au)N] 

=    v  +   (u-v)(l-au)N  (44) 

3.3      KNOWN    u    AND    v:     RELIABILITY  INFERENCE 

In  order  to  make  inferential  statements  about  the  random  variable    £ 
(and  hence    pj  given  some  data  has  been  observed,  we  proceed  again  in  a 
fashion  similar  to  that  used  in  the  analysis  of  the  continuous  model.    In 
particular,  we  may  write  for  the  conditional  p.d.f .  of    g_ ,    given  the  ob- 
served failure  data  vector    x  : 

f   (q|£)    -    6(q-u)P(Ujx)  +   6(q-v)  P(Rj  x)  (45) 

By  defining  the  event  G. 

(G. )   =    event    {the  transition  from  state  U  to  state  R  takes  place 


i 


immediately  after  the  i      failure} 


we  may  first  of  all  write 


P(U    I  x)   =     L    P(U    ,  G.  I  x) 
n1  ..  ,  n        l1 

i=l 


L    P(U    |G.  ,x)P(G.|  x)  (46) 

.    ,  n'     i  i ■ 

i=l 
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since 

i=l 

The  definition  of    G.    allows  us  to  write 

1 

CO  i  £  z 

P(UjG.,x)=  " 

/   1  1  >    z 

k  n 

since    z      is  the  total  number  of  failures  observed  in  the  first    n    trials  . 
n 

Thus ,  if    i  £  z     ,    the  transition  from  U  to  R  has  taken  place  at  or  before 
n 

the  n    -  trial,  and  the  system  cannot  be  in  state  U  at  the  n      trial . 
Equation  (46)  can  now  be  written 


P(Ujx)    =        L      P(G.|x)  (47) 

i=z 

n+1 


and,  using  Bayes1  Rule 


P(Unix)    =   ^±^ 


2      P(x|G.)P(G.) 


P(x) 


The  value  of    P(G.)    is  determined  from  the  underlying  geometric  process 
with  parameter    a,     so  that 

i-1 


L      P(x  |G.)a(l-a) 


P(U|x)    =    a±j^  (48) 

P(xl 


-  26  - 


We  now  note  that  when  the  transition  from  U  to  R  takes  place  at  some  trial 

after  the  n      [i.e.,  for  all  terms  in  the  summation  in  equation  (48)  ]  ,  we 

may  write 

1-x  x      l~x9         x9  l~x  x 

P(x|G.)    =    u        V-u)       u        2(l-u      ...u        n(l-u)  n 

z  y 

n,       .    n 
=    u      (1-u) 

since  all    n    trials  take  place  while  the  system  is  in  the  U  state  .    Com- 
bining this  result  with  equation  (48)  gives 

oo  z  y 

L      u      (1-u)       a(l-a) 

P(U    I  x  )    =    ^^ 

n1 


P(x) 

z             y             z 
=   H (1-u)      (1-a) (4g) 

P(x) 

The  calculation  of    P(R   J  x  )    is  also  accomplished  by  use  of  the  ex- 
haustive and  exclusive  character  of  the  event    (G. )    1=1,2,    ...«. 

00 

P(R    I  x)    =     L  P(R    ,G.|  x) 

n1  .    ,       n      i' 

i=l 

00 

=     L  P(R   |G.  ,x)  P(G.  I  x)  (50) 

.    ,       n1     i  i  ' 

i=l 

The  value  of    P(R    |G.  ,x")    is  determined  by  the  same  arguments  that  led  to 
n  ■    i 

equation  (47): 

(l  i  *  z 

P(R JG.  ,x)    =  n  (51) 

n1    i  /_ 

(0  i  >   z 
v  n 
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so  that  equation  (50)  becomes 


n 


P(R    I  x)    =     L   P(G.|  x) 

n1  .    ..         1 ' 


and,  using  Bayes;'  Rule  and    P(G. )  =  a(l-a) 


i-1 


P(Rjx)    = 


n 

L   P(x  |G.)  a(l-a) 
i=l I 


i-1 


(52) 


where  the  summation  is  defined  to  be  zero  when    z     =  0  . 

n 


Finally,  we  note  that  when    i  £  z 


n 


1-x. 


Xl     1_X2  X2  1_Xi  Xi 

P(x|G.)  (1-u)       u        Z(l-u)       ,..u        '(1-u) 


1-x,    ,  x.    ,  1-x  X 

l+l..      x    i+l  n,       .    n 

v  ( 1  -v)  o  o  o  v  (1  -v) 


i-y.  y.    n-i-y  +y  Y~Y. 

i.        .    l  n      i.        .    n      l 

=    u         (1-u)      v  (1-v) 


z.  y.     z   -z.  y   -y. 

i,.      v    i      n      i.       ,    n      i 
=    u     (1-u)      v  (1-v) 


(53) 


so  that 


n     z.  y.     z    -z.  y   -y.  .    , 

£  u     (1-u)      v  (1-v)  a(l-a) 


P(Rn|x)  =  ^ 


(54) 


P(x) 

Complete  inferential  statements  about  the  failure  probability    q  ,    given  the 
observed  data    x  ,    may  now  be  readily  made  using  the  posterior  p.d.f . 
f   (q|  x  )  „    This  has  been  obtained,  essentially,  since  we  now  need  to 
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simply  substitute  the  expressions  for  P(U  |  x  )  and  P(R  |  x  )  (from  equa- 
tions (49)  and  (54)  ,  respectively)  into  equation  (45)  .  Note  that  the  common 
term  of  P(x  )  in  the  denominators  of  equations  (49)  and  (54)  can  be  evalu- 
ated by  means  of 

P(UN|x~)    +    P(Rjx~)    =    1 

3.4      UNKNOWN    u    AND    v:     RELIABILITY  PROJECTION 

When  the  failure  probabilities    u    and    v    are  unknown,  we  proceed  as 

in  section  2  .5  by  treating  these  parameters  as  random  variables    u_   and    v  , 

with  joint  p  .d  .f  .    f     (u,v)  =  f     (u,v|H).    Again,  we  shall  (for  ease  in 

uy  uv  ' 

development)  assume  that    u    and    v    are  independent,  so  that 

fuv(u'v>    =    £u(u)£v(v) 

Use  of  the  technique  illustrated  by  equation  (30)  gives  the  following 
results  .    (Intermediate  steps  have  been  left  out.    The  development  parallels 
that  of  section  2  .5) 

f   (q;N)  =    f     "  {6(q-u)(l-au)N+   6(q-v)  [  1  -  (l-au)N]  }  f      (u,v)dudv 


0     0 


=    (l-aq)Nf   (q)  +  f  (q)    f   [  1  -  (l-a.§)N]  f   (?)  d§ 


(55) 


The  projected  expectation  of  the  failure  probability  at  the  end  of  N  trials  is 

1 
E(q;N)    =    Jqf   (q;N)dq 


'0     * 


1  m...  -   .  r1. 


=      '  If  (?)(l-a§)Nd§+  E(^)     '   [l-(l-a§)]Nf  (S)d§  (56) 

Jo    -  Jo  - 
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3,5      UNKNOWN    u    AND    v:    RELIABILITY  INFERENCE 

When  a  data  vector    x     has  been  observed,  and    u    and    v    are  random 
variables  with  prior  p<,d„f  „    f      (u,v)  ,     conditional  density  functions  on    u  , 
v    and    g_    can  be  derived  in  a  manner  parallel  to  that  used  for  the  continuous 
case  in  section  2.6. 

To  keep  the  expressions  concise,  we  define  the  following  terms: 

z  y  z 

P(UN,x;u)    =    u   n(l-u)    n(l-a)    n  (57) 

z 

n     z.  y.    z   -  z.  y  -y. 

P(R    ,x;u,v)    =     L  u  ^l-u)    \  n      ^l-v)    n      ^(l-a)1  (58) 

•     n  1=1 

P(x*;u,v)    =    P(U    ,x*;u)  +  P(R    ,x";u,v)  (59) 

n  n 


1    1 
P(x) 


t 


P(x;u,v)f     (u,v)dudv  (60) 

0  — 


The  posterior  density  functions  of  interest  then  become  (after  intermediate 

steps  similar  to  those  in  section  2.6) 

1 
'  P(x;u,v)  f     (u,v)  dv 

f  (u|x)    =    •* (61) 

-  P(x) 

1 


f  P(x;u,v)  f     (u,v)  du 
jn  uv 


f   (v|  x)   =   -9 (62) 

~  P(x) 

1 


'  P(R    ,x*;u,q)f  (u)  du  +  P(TJ   ;x,q) 
i/i-N         n  u.  n. 


f   (q|x)    =    ~ (63) 

^  P(x) 
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and  the  posterior  probability  that  the  system  has  been  repaired  is 

1.1 


P(R    ,x;u,v)  f      (u,v)  dudv 
»L  tL         n  uv 


P(R    |  *)    =    "" "  (64) 

PU) 


4.      NUMERICAL  EXAMPLES 
4.1      CONTINUOUS  MODEL 

A  numerical  example  is  now  presented  to  illustrate  the  use  of  the 
results  of  the  previous  sections  . 

The  first  task  is  the  assignment  of  appropriate  prior  probability  density 
functions  for  the  failure  rates     X    (before  repair)  and    tL    (after  repair)  .    In 
order  to  facilitate  calculations  it  is  convenient  to  assume  that  these  random 
variables  are  independent  and  have  prior  density  functions  of  the  Gamma 
family,  so  that 

B  a  -1    -ex 

MX)    =    =£-,   X    J       e  (65) 

B  a  -1    in 

f   fe)    =   ^rf--   n  e  (66) 

Furthermore,  we  suppose  that  estimates  are  available  for  the  moments 

of    u    and    v.    A  particular  set  of  such  estimates  is 

E(\)  =   1  E(ii)    =    .5 

(67) 

a(X)  =    1  a(ji)    =    -5 
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J. 

here  E(_X)    =         X  f   (X)  dX  =    expected  value  of    X 


2 


1 


a"(A)  [X  -  E(A)]    MM  d^  =    variance  of    X 

This  set  of  estimates,  in  conjunction  with  equations  (65)  and  (66)  give 

Oj  =  i  «2  =  1 

Bl  =    1  B2   =    2 

The  repair  probability  is  assumed  known  and  to  have  value    a  =  .2  5 

These  figures  are  selected  not  with  a  physical  example  in  mind,  but 
with  the  intention  of  displaying  the  underlying  features  of  the  model.    Thus 
we  at  this  point  have  assumed  the  following., 

.  At  the  start  of  testing,  the  system  has  a  constant  failure  rate    X    that 
is  unknown,  but  is  estimated  to  be  about  1  (per  unit  time)  .    The  precision 
of  this  estimate  is  indicated  by  a  standard  deviation  of  1  (per  unit  time)  . 

.  After  every  failure  an  attempt  at  repair  is  made  .    This  attempt  has 
probability    a  =  .25    of  succeeding,  i.e.,  putting  the  system  in  the  "repaired" 
state  . 

.  When  the  system  has  been  repaired,  the  failure  rate  decreases  to  a 
constant  value    |i    which  is  unknown,  but  which  (from  experience  or  judicial 
guessing)  can  be  estimated  to  be   .5  (per  unit  time)  with  a  standard  deviation 
also  of  .5  (per  unit  time)  . 

We  now  proceed  to  make  statements  about:    the  failure  rate  after  some 
length  of  future  test  time  (projection);  updated  estimates  of    X    and    |i    on 
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the  basis  of  failure  data  gathered  during  the  experiment  (inference);  the  sys- 
tem failure  rate    r    after  observation  of  failure  data  . 
Projection: 

Using  the  values  given  above,  the  p.d.f .  for  the  failure  rate  _r    at  some 
time    t    after  the  start  of  the  growth  program  is,  from  equation  (31) 

-2r 
c,       v  -r(l+.25T)  .5Te  ,„. 

yrjT)  =  e  ♦  t*~3&  (68) 

and  so  the  expected  value  of  the  failure  rate  after  time    t    is  ,  from  (32) 

E(^)  = 'tt1^)2  +  irrfe  <69> 

From  this  expression  we  see  that  the  expected  failure  rate  will  drop 
halfway  between  its  unrepaired  and  repaired  values  after  a  length  of  approxi- 
mately   t  *   12    units  . 
Inference: 

In  order  to  make  inferential  statements  about    X  ,    \±    and  jr  ,    a  data 
vector  is  needed . 

Suppose  that  failures  are  observed,  after  the  start  of  testing,  at  times 
1,  2,  3,  4,  6.2,  8.2,   10. 2,  so  that    n  =  number  of  failures  =  7    and 

r=    (1,2,3,4,6.2,8.2,10.2) 
[This  data  vector  was  chosen  to  intentionally  —  and  crudely  —  simulate  a 
"repair"  at    t  =  4    and  a  decrease  in  failure  rate  from  1  to   .5] 

For  any  time    t,     equations  (38)  ,   (39)  and  (40)  give  the  p.d.f.  for    \, 
jj.    and  £/  respectively;  equation  (36)  gives  the  probability  that  the  system 
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has  been  repaired  at  or  before  that  time  .    In  our  numerical  example,  we  can 
examine  these  posterior  density  functions  by  finding  their  means  and  stand- 
ard deviations  .    For  the  prior  parameters  and  data  vector  given  above,  these 
have  been  calculated  and  are  shown  in  Table  1  for  values  of    t    from  0  to 
10  .2  by  increments  of    A  t  =    .2    time  units  . 
Projection  after  Inference: 

At  this  point  it  is  possible  to  extend  the  development  to  describe  the 
following  situation. 

Suppose  that  prior  parameters  have  been  selected, as  above,  and  the 
inferential  calculations  carried  out.    At  time    t  =   10  .2,  after  having  seen 
the  7  failures  described  by    t  ,    what  can  we  say  about  the  expectation  of 
the  failure  rate  at  some  time    t      after  time    t  =   10  .2? 

In  order  to  answer  this  question  we  note  that  at  time    t  =   10  .2    we 
have  (see  Table  1) 

E(X)    =     .917  E(yJ    =     .543 

ct(X)    =     °522  a  (Hi)    =     °322  (70) 

P(Ri2|T)  =     .846 

We  are  now  faced  with  the  situation  described  in  the  discussion  fol- 
lowing equation  (1)  .    For  we  may  consider  the  situation  to  be  such  that  the 
values  of  equation  (70)  describe  our  total  knowledge  about    X    and   jj.    up  to 
that  point;  i.e.,  they  can  serve  to  define  a  new  "prior"  density  function, 
with  parameters    a'  ,   B'     ot'    and    B  '  . 
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Doing  so,  we  find  that 


a'  =    1  .75 


P  '  =    1.92 


a  '  =    1.68 
B2'  =3.10 


In  addition,  we  now  have  the  situation  where  the  value  of 
p     =    prob    {system  is  in  R  at  time  0} 

=    pfR12lT}    =     '846 
A  simple  argument  leads  to  the  modification  of  equation  (8)  for  the  case 

when    p     ^  0: 

fr(r;T)    =   6(r-X)(l-po)e"iiXT+   6(r-p,)  [  l-(l-po)e"aXT  ]  (71) 

and,  consequently,  equation  (31)  becomes 


fr(r;r)    =   (l-po)fx(r)e~arT  +  f^(r)  J  [  l-(l-Po)e~a|T]fx(?)d§ 


(72) 


~      v0 


Taking  the  expectation  of  equation  (72)  ,  using  the  primed  prior  parameters, 


we  get 

E(r|tV) 


expected  value  of  failure  rate  time    t'    after    t  =  12, 
given    t 


=     .543   + 


1*1'  2    L 

.485(.72  -   .136  t') 
(1.92  +   .25t')2'75 


1-(1-P0) 


Pl 


a. 


i  _ 


P  '+a-r' 

11 
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Sensitivity: 

The  model  has  not  been  fully  evaluated  with  regard  to  the  sensitivity  of 
results  to  values  of  the  prior  parameters,  errors  in  estimation  of    a,     etc. 
However,  examples  for  various  cases  have  been  calculated. 

Tables  3  through  6  show    E(x)  ,  <?U)  ,  E(m)  ,  oM  ,  P(R   )  ,  E(r)  and  <j(r) 

T 

all  conditioned  upon  the  data  vector    t=   (1,2,3,4,6.2,8.2,10.2)    and  evalu- 
ated at    t  =  0    to  10  .2  by  increments  of    At  =    .2    time  units  .    These  calcu- 
lations contain  the  prior  parameters  as  shown  in  Table  2  . 


Table 

"l 

Bl 

a2 

h 

E(X) 

a(X) 

e(mO 

afrO 

a 

1 

1 

1 

1 

2 

1 

1 

.5 

.5 

.25 

3 

4 

4 

4 

8 

1 

.5 

.5 

..25 

.25 

4 

1 

2 

1 

4 

.5 

.5 

.25 

.25 

.25 

5 

4 

4 

4 

8 

1 

.5 

.5 

.25 

.12 

6 

4 

4 

4 

8 

1 

.5 

.5 

.25 

.50 

TABLE  2 
Prior  Parameters  Used  in  Calculations  of  Tables  3-6 


4.2      DISCRETE  MODEL 

For  the  discrete  model,  numerical  calculations  become  simplified  when 
the  prior  probability  density  functions  for  the  failure  probabilities  u  and  v 
are  of  the  Beta  family  of  p  0d  0f  „'s  ,  where 


B(x;a,  B)    = 


ne) 


r(a)ro-a) 


ot-1,,      \  B-arl 
x        (1-x) 


(73) 
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<// 


The  moments  of  this  function  are 
E(x)    =    a/ 6 
V(x)    -    a2(x)    =    f  <i  "  f  >  iTI  <74> 

Unfortunately,  even  this  usually  "conjugate  prior"  form  does  not  allow 
a  closed  form  solution  of  the  projection  problem,  as  exemplified  in  equations 
(55)  and  (56)  .    This  is  not  to  say  that  specific  projections  cannot  be  made  -- 
the  associated  numerical  integrations  are  straightforward,  but  have  not  been 
attempted  here  . 

The  more  interesting  inferential  problem  may  be  easily  evaluated,  how- 
ever, and  is  illustrated  in  Tables  8  through  12  . 

The  data  vector  is  assumed  to  be 

x~=    (0,1,0,1,0,1,0,1,1,1,0,1,1,1,0,1,1,1,0,1,1,1,0) 
where  a  "0"  represents  a  failure,  a  "1"  represents  a  success.    Again, 
this  "observed"  data  vector  has  been  pre-selected  to  simulate  an  overly 
typical  result  that  might  appear  if    u  =   .5     v  =  .25    and  repair  took  place 
on  the  7th  trial  (the  4th  failure)  .    Numerical  results  now  simply  require  a 
set  of  prior  parameters  and  the  determination  of  the  first  and  second  mo- 
ments of  equations  (61)  ,   (62)  and  (63)  . 

In  the  calculation  of  a  number  of  cases  for  various  values  of  prior 
parameters,  it  becomes  convenient  to  work  with  the  success  probabilities 
1-u    and    1-v,    rather  than    u    and    v    directly.    Table  7  shows  the  selection 
of  values  of  the  prior  parameters  for    1-u    and    1-v,    and  for  the  repair 
probability    a  „ 
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Table 

E(l-u) 

gt(I-u) 

E(l-v) 

a(l-v) 

a 

8 

.5 

,2887 

.75 

.3660 

.25 

9 

..5 

.3536 

.75 

.3953 

.25 

10 

.4 

.2619 

.6 

.4 

.25 

11 

.5 

.3536 

.75 

.3953 

.125 

12 

.5 

.3536 

.75 

.3953 

.5 

TABLE  7 
Prior  Parameters  Used  in  Calculation  of  Tables  8-12 


5  .      MANY  FAILURE  MODES 
5  . 1      NOTATIONAL  EXTENSION 

In  order  to  treat  the  more  realistic  case  of  systems  with  multiple  failure 
modes,  we  introduce  a  simple  extended  model  and  notation,  and  then  show 
that  this  case  is  solved  formally  by  a  simple  extension  of  previously  ob- 
tained solutions  .    The  development  will  be  only  for  the  continuous  model, 
although  a  similar  one  for  the  discrete  case  can  be  directly  obtained  by 
means  of  a  parallel  analysis  . 

We  now  assume  that  a  system  can  exhibit  a  total  of    M    independent 
failure  modes  (characterized,  by  definition,  by  their  distinguishability)  . 
We  also  assume  that  a  repair  of  a  mode  is  possible  only  at  a  repair  attempt 
made  after  an  observed  failure  of  that  mode  . 

We  then  define,  for  mode    i    (i  =  1 ,  2  ,    .  .  .  M)  , 
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— .  CDP"-ir\oror^mcDr-c^cDf^-aoco(N^-<ocDOf^»J-»-<fM 

^Q;  fNifOCO<0^-,©^<0«0>0>00^'0«0»0'Oinm«J-«*,^fO 


in  — <  >r  tr  cr  >r  w  ro  mm  r—  tr  co  c\i  ^-nj  o  >r  r-  o  cr>  in  eg 
r^u^^ij^u>cuiurn'-*>J"^'-«u,'><J'TU,f'nNt>ir',ninujo> 
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^f 


X.    =    failure  rate  when  i      mode  is  unrepaired 

|i .   =    failure  rate  when  i      mode  is  repaired 

a     =    probability  of  repairing  the  i      mode  given  an  attempt  is  made 

The  entire  system  will  have  an  overall  failure  rate    r,    which,  by  virtue 

of  the  exponential  failure  behavior  of  each  component,  is 

M 
r  =     r  r. 

i=l   X 


where 


X.  i      mode  is  unrepaired 


H.  i      mode  is  repaired 

This  last  expression  serves  to  recall  that, according  to  our  previous  analysis, 

the  failure  rates  are  in  themselves  random  variables  . 

If,  then,  the  failure  rate  for  each  mode  is  a  random  variable    r.,    with 

- 1 

known  p.d.f.    f     (r. )     [and  thus  known  moments],  we  have  in  particular  for 

— i 
the  overall  system 

£r(r)    "    fr    <rl'*    £r'r2'    *   ■■■irJ<M>  (75) 

-  -1  ~2  ~M 

where  the  *  indicates  the  convolution  operation  „ 

Because  of  the  independence  of  the  failure  modes,  and  since  the  repair 

of  any  one  mode  is  independent  of  the  state  of  the  others,  we  see  that  each 

of  the    f     (r.)    of  equation  (75)  is  available  from  expressions  such  as  (31) 
— i 

[for  projection]  or  (40)   [for  inference]  .    In  these  expressions  we  must  only 
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replace  the  parameters    (r,   X,  \i ,  a)    by    (r, ,   X. ,  |i , ,  a. )  ,  and  note  that    t 

now  represents  the  times  of  occurrences  of  i      mode  failures  . 

To  make  matters  even  simpler  for  practical  purposes,  we  note  that  since 

X  =  £_F_.   >  anc*  tne    r-    are  independent,  we  can  immediately  write  for  the 

expectation  and  variances: 

M 
E(r)    =     L  E(r   ) 
i=l 

2  M     2 

V(r)    =    a    (L)    =      2  CT    (r.) 

i=l  X 


6 .      CONCLUSION 
6  . 1      OTHER  MODELS  OF  RELIABILITY  GROWTH 

Discussion  of  the  literature  on  reliability  growth  models  has  been 
intentionally  postponed  to  this  final  section  in  order  to  facilitate  compari- 
son with  this  paper . 

The  subject  of  reliability  improvement  by  means  of  conscious  efforts  on 
the  part  of  designers,  test  engineers,  customers,  etc.  has  been  of  interest 
from  the  beginnings  of  reliability  analysis  „    The  modelling  of  such  growth 
processes  has  followed,  for  the  most  part,  a  common  procedure:    formulae 
are  presented  that  are  intended  to  represent  the  growth  of  reliability  (or  the 
decrease  in  failure  rate,  etc.)  as  a  function  of  time.    These  formulae  con- 
tain unknown  parameters,  and  it  becomes  a  statistical  problem  to  find  appro- 
priate estimates  (and  confidence  statements)  for  these  parameters  as  a 
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function  of  observed  failure  data.    Such  methods  are  found,  for  example,  in 
references  [10],  [3],   [15]  and. [8]  .    Sherman  [14],  for  example,  finds 
Maximum  Likelihood  Estimates  for  the  repair  probability    a    and  the  unre- 
paired failure  probability    u    when  it  is  assumed  that  the  repaired  failure 
probability    v    is  zero  „ 

Another  approach  is  to  assume  that  little  is  known  about  the  underlying 
failure  behavior  of  the  system,  and  what  amounts  to  "almost"  non-parametric 
analysis  is  made  upon  eventual  failure  rates  (or  probabilities)  .    This  is 
summarized  in  [1]  . 

Bayesian  techniques  have  been  used  only  recently.    A  non-parametric 
Bayesian  analysis  of  a  failure  probability,  constrained  to  be  only  non- 
increasing  in  time,  may  be  modelled  by  the  technique  shown  in  Samuels  [13] 
Larson  [9]  has  extended  an  earlier  analysis  [8]  to  produce  Bayesian  esti- 
mates of  parameters  of  a  growth  model,  using  prior  distributions  suggested 
by  Earnest  [5]  .    Finally,  Cozzolino  [4]  has  presented  a  Bayesian  approach 
to  a  general  class  of  growth  models  with  regard  to  making  minimum-cost 
decisions  about  length  of  tests  and  burn-in  procedures  . 

All  of  the  above  analyses,  however,  start  with  a  basic  assumption: 
that  the  reliability  will  grow  (or,  at  least,  will  not  decrease)  in  time.    If 
the  techniques  derived  previously  were  to  be  used  for  a  system  that  was 
actually  deteriorating  (naturally,  or  because  of  well-intentioned  intervention)  , 
the  results  would  be  meaningless  .    In  practice,  unfortunately,  there  is  often 
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a  need  to  have  an  inferential  technique  that  would  spot  such  deterioration, 
as  well  as  one  equally  good  at  determining  appropriate  growth  character- 
istics . 

6o2      CONCLUSION 

This  paper  has  attempted  to  model  a  process  that  simply  considers  a 
system  (with  regard  to  each  failure  mode)  to  be  in  either  a  repaired  or  un- 
repaired state  o    The  failure  rates  in  each  state  are  known  to  any  desired 
degree  of  confidence,  and  accumulation  of  failure  data  serves,  in  a  natural 
way,  to  update  the  knowledge  of  these  state  parameters  .    The  observation 
of  failure  data  also  determines  the  probability  that  the  system  is  repaired 
(with  respect  to  each  mode)  . 

The  weakest  points  of  the  model  seem  to  be  the  assumptions  that 
o  The  repair  probability    a    is  known 
.  Repair  attempts  occur  only  after  the  observation  of  a  failure 

The  first  point  can  be  overcome  (at  the  expense  of  additional  com- 
plexity) by  considering    a    to  be  a  random  variable    a_  with  appropriate 

prior  p„d„f  „    f  (a|H)  .    All  analysis  would  then  include  a  posterior  infer- 
os 

ential  p„d„f0  for    a_,    given  a  data  vector. 

The  second  point  is  unfortunately  too  much  at  the  heart  of  the  model . 
For  many  realistic  systems,  the  assumption  seems  to  be  valid,  however, 
as  the  tendency  is  not  to  "ruin  a  good  thing"  . 
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It  should  be  pointed  out  that  the  model  considered  here  is  a  specific 
example  of  a  process  which  Howard  [6]  calls  "Dynamic  Inference"  .    This 
general  concept  is  quite  useful  in  modelling  a  stochastic  process  in  which 
the  underlying  parameters  are  allowed  to  change  according  to  yet  another 
stochastic  process  .    The  interested  reader  is  referred  to  reference  [6], 
where  (as  becomes  apparent  upon  studying  the  Tables  2-6  and  8-12)  the 
statement  is  made,   "The  numerical  results  indicate  a  complexity  of  behavior 
that  challenges  intuition"  . 
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